Survey of Distributed Computing Frameworks for Supporting Big Data Analysis

نویسندگان

چکیده

Distributed computing frameworks are the fundamental component of distributed systems. They provide an essential way to support efficient processing big data on clusters or cloud. The size increases at a pace that is faster than increase in capacity clusters. Thus, based MapReduce model not adequate analysis tasks which often require running complex analytical algorithms extremely sets terabytes. In performing such tasks, these face three challenges: computational inefficiency due high I/O and communication costs, non-scalability memory limit, limited because many serial cannot be implemented programming model. New need developed conquer challenges. this paper, we review MapReduce-type currently used handling discuss their problems when conducting analysis. addition, present non-MapReduce framework has potential overcome

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Data Processing Frameworks for Big Graph Data

Recently we create so much data (2.5 quintillion bytes every day) that 90% of the data in the world today has been created in the last two years alone [1]. This data comes from sensors used to gather traffic or climate information, posts to social media sites, photos, videos, emails, purchase transaction records, call logs of cellular networks, etc. This data is big data. In this report, we fir...

متن کامل

Advanced Visual Interfaces Supporting Distributed Cloud-Based Big Data Analysis

Handling the complexity of relevant data requires new techniques with regard to data access, visualization, perception, and interaction for innovative and successful strategies. As a response to increased graphics performance in computing technologies and Information Visualization, Card et al. developed the Information Visualization Reference Model. Due to further developments in Information Sy...

متن کامل

A Survey of Statistical Methods and Computing for Big Data

Big data are data on a massive scale in terms of volume, intensity, and complexity that exceed the capacity of standard software tools. They present opportunities as well as challenges to statisticians. The role of computational statisticians in scientific discovery from big data analyses has been under-recognized even by peer statisticians. This article reviews recent methodological and softwa...

متن کامل

Security Methods for Privacy Preserving and Data Sharing Over Cloud Computing and Big Data Frameworks

The cloud computing is one of the widely used services for resource management by many IT (information technology) and non-IT organizations due to its different benefits in terms of time saving and cost savings to the companies. Such cloud computing frameworks are used to store the small to big data efficiently. Most of companies want to store huge amount of data and hence along with cloud comp...

متن کامل

Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks

The term ‘Big Data’ has spread rapidly in the framework of Data Mining and Business Intelligence. This new scenario can be defined by means of those problems that cannot be effectively or efficiently addressed using the standard computing resources that we currently have. We must emphasize that Big Data does not just imply large volumes of data but also the necessity for scalability, i.e., to e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Big data mining and analytics

سال: 2023

ISSN: ['2096-0654']

DOI: https://doi.org/10.26599/bdma.2022.9020014